NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking

Liu, Yunchao; Dong, Ha; Wang, Xin; Moretti, Rocco; Wang, Yu; Su, Zhaoqian; Gu, Jiawei; Bodenheimer, Bobby; Weaver, Charles; Meiler, Jens; et al (November 2024, Advances in Neural Information Processing Systems (NeurIPS))

Full Text Available
Interpretable Chirality-Aware Graph Neural Network for Quantitative Structure Activity Relationship Modeling in Drug Discovery

https://doi.org/10.1609/aaai.v37i12.26679

Liu, Yunchao; Wang, Yu; Vu, Oanh; Moretti, Rocco; Bodenheimer, Bobby; Meiler, Jens; Derr, Tyler (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

In computer-aided drug discovery, quantitative structure activity relation models are trained to predict biological activity from chemical structure. Despite the recent success of applying graph neural network to this task, important chemical information such as molecular chirality is ignored. To fill this crucial gap, we propose Molecular-Kernel Graph NeuralNetwork (MolKGNN) for molecular representation learning, which features SE(3)-/conformation invariance, chirality-awareness, and interpretability. For our MolKGNN, we first design a molecular graph convolution to capture the chemical pattern by comparing the atom's similarity with the learnable molecular kernels. Furthermore, we propagate the similarity score to capture the higher-order chemical pattern. To assess the method, we conduct a comprehensive evaluation with nine well-curated datasets spanning numerous important drug targets that feature realistic high class imbalance and it demonstrates the superiority of MolKGNN over other graph neural networks in computer-aided drug discovery. Meanwhile, the learned kernels identify patterns that agree with domain knowledge, confirming the pragmatic interpretability of this approach. Our code and supplementary material are publicly available at https://github.com/meilerlab/MolKGNN.
more » « less
Full Text Available
Time-resolved live-cell spectroscopy reveals EphA2 multimeric assembly

https://doi.org/10.1126/science.adg5314

Shi, Xiaojun; Lingerak, Ryan; Herting, Cameron J; Ge, Yifan; Kim, Soyeon; Toth, Paul; Wang, Wei; Brown, Benjamin P; Meiler, Jens; Sossey-Alaoui, Khalid; et al (December 2023, Science)

Ephrin type-A receptor 2 (EphA2) is a receptor tyrosine kinase that initiates both ligand-dependent tumor-suppressive and ligand-independent oncogenic signaling. We used time-resolved, live-cell fluorescence spectroscopy to show that the ligand-free EphA2 assembles into multimers driven by two types of intermolecular interactions in the ectodomain. The first type entails extended symmetric interactions required for ligand-induced receptor clustering and tumor-suppressive signaling that inhibits activity of the oncogenic extracellular signal–regulated kinase (ERK) and protein kinase B (AKT) protein kinases and suppresses cell migration. The second type is an asymmetric interaction between the amino terminus and the membrane proximal domain of the neighboring receptors, which supports oncogenic signaling and promotes migration in vitro and tumor invasiveness in vivo. Our results identify the molecular interactions that drive the formation of the EphA2 multimeric signaling clusters and reveal the pivotal role of EphA2 assembly in dictating its opposing functions in oncogenesis.
more » « less
Full Text Available
IHMCIF: An Extension of the PDBx/mmCIF Data Standard for Integrative Structure Determination Methods

https://doi.org/10.1016/j.jmb.2024.168546

Vallat, Brinda; Webb, Benjamin M; Westbrook, John D; Goddard, Thomas D; Hanke, Christian A; Graziadei, Andrea; Peisach, Ezra; Zalevsky, Arthur; Sagendorf, Jared; Tangmunarunkit, Hongsuda; et al (March 2024, Journal of Molecular Biology)

Full Text Available
Allele-specific activation, enzyme kinetics, and inhibitor sensitivities of EGFR exon 19 deletion mutations in lung cancer

https://doi.org/10.1073/pnas.2206588119

Brown, Benjamin P.; Zhang, Yun-Kai; Kim, Soyeon; Finneran, Patrick; Yan, Yingjun; Du, Zhenfang; Kim, Jiyoon; Hartzler, Abigail Leigh; LeNoue-Newton, Michele L.; Smith, Adam W.; et al (July 2022, Proceedings of the National Academy of Sciences)

Oncogenic mutations within the epidermal growth factor receptor (EGFR) are found in 15 to 30% of all non–small-cell lung carcinomas. The term exon 19 deletion (ex19del) is collectively used to refer to more than 20 distinct genomic alterations within exon 19 that comprise the most common EGFR mutation subtype in lung cancer. Despite this heterogeneity, clinical treatment decisions are made irrespective of which EGFR ex19del variant is present within the tumor, and there is a paucity of information regarding how individual ex19del variants influence protein structure and function. Herein, we identified allele-specific functional differences among ex19del variants attributable to recurring sequence and structure motifs. We built all-atom structural models of 60 ex19del variants identified in patients and combined molecular dynamics simulations with biochemical and biophysical experiments to analyze three ex19del mutations (E746_A750, E746_S752 > V, and L747_A750 > P). We demonstrate that sequence variation in ex19del alters oncogenic cell growth, dimerization propensity, enzyme kinetics, and tyrosine kinase inhibitor (TKI) sensitivity. We show that in contrast to E746_A750 and E746_S752 > V, the L747_A750 > P variant forms highly active ligand-independent dimers. Enzyme kinetic analysis and TKI inhibition experiments suggest that E746_S752 > V and L747_A750 > P display reduced TKI sensitivity due to decreased adenosine 5′-triphosphate K m . Through these analyses, we propose an expanded framework for interpreting ex19del variants and considerations for therapeutic intervention.
more » « less
Full Text Available
CACHE Challenge #1: Targeting the WDR Domain of LRRK2, A Parkinson’s Disease Associated Protein

https://doi.org/10.1021/acs.jcim.4c01267

Li, Fengling; Ackloo, Suzanne; Arrowsmith, Cheryl H; Ban, Fuqiang; Barden, Christopher J; Beck, Hartmut; Beránek, Jan; Berenger, Francois; Bolotokova, Albina; Bret, Guillaume; et al (November 2024, Journal of Chemical Information and Modeling)

The CACHE challenges are a series of prospective benchmarking exercises to evaluate progress in the field of computational hit-finding. Here we report the results of the inaugural CACHE challenge in which 23 computational teams each selected up to 100 commercially available compounds that they predicted would bind to the WDR domain of the Parkinson’s disease target LRRK2, a domain with no known ligand and only an apo structure in the PDB. The lack of known binding data and presumably low druggability of the target is a challenge to computational hit finding methods. Of the 1955 molecules predicted by participants in Round 1 of the challenge, 73 were found to bind to LRRK2 in an SPR assay with a KD lower than 150 μM. These 73 molecules were advanced to the Round 2 hit expansion phase, where computational teams each selected up to 50 analogs. Binding was observed in two orthogonal assays for seven chemically diverse series, with affinities ranging from 18 to 140 μM. The seven successful computational workflows varied in their screening strategies and techniques. Three used molecular dynamics to produce a conformational ensemble of the targeted site, three included a fragment docking step, three implemented a generative design strategy and five used one or more deep learning steps. CACHE #1 reflects a highly exploratory phase in computational drug design where participants adopted strikingly diverging screening strategies. Machine learning-accelerated methods achieved similar results to brute force (e.g., exhaustive) docking. First-in-class, experimentally confirmed compounds were rare and weakly potent, indicating that recent advances are not sufficient to effectively address challenging targets.
more » « less
Full Text Available
Integrating linear optimization with structural modeling to increase HIV neutralization breadth

Sevy, Alexander; Panda, Swetasudha; Crowe, James E; Meiler, Jens; Vorobeychik, Yevgeniy (January 2018, PLOS computational biology)

Computational protein design has been successful in modeling fixed backbone proteins in a single conformation. However, when modeling large ensembles of flexible proteins, current methods in protein design have been insufficient. Large barriers in the energy landscape are difficult to traverse while redesigning a protein sequence, and as a result current design methods only sample a fraction of available sequence space. We propose a new computational approach that combines traditional structure-based modeling using the ROSETTA software suite with machine learning and integer linear programming to overcome limitations in the ROSETTA sampling methods. We demonstrate the effectiveness of this method, which we call BROAD, by benchmarking the performance on increasing predicted breadth of anti-HIV antibodies. We use this novel method to increase predicted breadth of naturally-occurring antibody VRC23 against a panel of 180 divergent HIV viral strains and achieve 100% predicted binding against the panel. In addition, we compare the performance of this method to state-of-the-art multistate design in ROSETTA and show that we can outperform the existing method significantly. We further demonstrate that sequences recovered by this method recover known binding motifs of broadly neutralizing anti-HIV antibodies. Finally, our approach is general and can be extended easily to other protein systems.
more » « less
Full Text Available
Structure–function analysis of oncogenic EGFR Kinase Domain Duplication reveals insights into activation and a potential approach for therapeutic targeting

https://doi.org/10.1038/s41467-021-21613-6

Du, Zhenfang; Brown, Benjamin P.; Kim, Soyeon; Ferguson, Donna; Pavlick, Dean C.; Jayakumaran, Gowtham; Benayed, Ryma; Gallant, Jean-Nicolas; Zhang, Yun-Kai; Yan, Yingjun; et al (March 2021, Nature Communications)

Abstract Mechanistic understanding of oncogenic variants facilitates the development and optimization of treatment strategies. We recently identified in-frame, tandem duplication ofEGFRexons 18 - 25, which causes EGFR Kinase Domain Duplication (EGFR-KDD). Here, we characterize the prevalence ofERBBfamily KDDs across multiple human cancers and evaluate the functional biochemistry of EGFR-KDD as it relates to pathogenesis and potential therapeutic intervention. We provide computational and experimental evidence that EGFR-KDD functions by forming asymmetric EGF-independent intra-molecular and EGF-dependent inter-molecular dimers. Time-resolved fluorescence microscopy and co-immunoprecipitation reveals EGFR-KDD can form ligand-dependent inter-molecular homo- and hetero-dimers/multimers. Furthermore, we show that inhibition of EGFR-KDD activity is maximally achieved by blocking both intra- and inter-molecular dimerization. Collectively, our findings define a previously unrecognized model of EGFR dimerization, providing important insights for the understanding of EGFR activation mechanisms and informing personalized treatment of patients with tumors harboring EGFR-KDD. Finally, we establishERBBKDDs as recurrent oncogenic events in multiple cancers.
more » « less
Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks

https://doi.org/10.1038/s41467-021-27222-7

Koehler Leman, Julia; Lyskov, Sergey; Lewis, Steven M.; Adolf-Bryfogle, Jared; Alford, Rebecca F.; Barlow, Kyle; Ben-Aharon, Ziv; Farrell, Daniel; Fell, Jason; Hansen, William A.; et al (December 2021, Nature Communications)

Abstract Each year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework, and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.
more » « less
Full Text Available
Federating Structural Models and Data: Outcomes from A Workshop on Archiving Integrative Structures

https://doi.org/10.1016/j.str.2019.11.002

Berman, Helen M.; Adams, Paul D.; Bonvin, Alexandre A.; Burley, Stephen K.; Carragher, Bridget; Chiu, Wah; DiMaio, Frank; Ferrin, Thomas E.; Gabanyi, Margaret J.; Goddard, Thomas D.; et al (December 2019, Structure)

Full Text Available

Search for: All records